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JUL 2 0 2006 

This listing of claims will replace all prior versions of claims in the application: 
Listing of Claims: 

1 . (Currently Amended) A system that facilitates spell checking, comprising: 
a component that receives input data containing text; and 

a spell checking component that identifies a set of potentially misspelled substrings in the 
text and proposes at least one alternative spelling for the substring set based on at least one query 
log; the query log comprising data utilized by users to query a data collection over a time frame^ 
the spell checking component utilizes substring occurrence and co-occurrence statistics from the 
at least one cme rv log, the substring co-occurrence statistics comprising substring bieram counts 
with stop-word-seouence-skipping counts. 

2. (Original) The system of claim 1, the spell checking component further utilizes user- 
dependent information in proposing at least one alternative spelling. 

3. (Currently Amended) The system of claim 1, the alternative spelling for the substring set 
is further based on at least one trusted lexico n; th e trust e d l e xicon comprising at l e ast on e 
selected from the group consisting of a trusted lexicon with content and a trusted loxioon without 

PPHBBl IL» 

4. (Currently Amended) The system of claim 3, the spell checking component further 
employs a list of stop word s; tho Hot of otop wordc comprising - at least ono s e l e ct e d from th e 
group - Goacisting of a liat of atop wordo with content - and a list of - stop words without content . 

5. (Currently Amended) The system of claim 4, the list of stop words with cont e nt 
comprising a Gtop word list containing high frequency words and function words and their 
frequent misspellings. 
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6. (Original) The system of claim 4, the spell checking component employs an iterative 
process to search a space of alternative spellings. 

7. (Original) The system of claim 6, the spell checking component employs, at least in part, 
heuristics to impose restrictions on a search space utilized to determine a proposed alternative 
spelling- 
s' (Original) The system of claim 7, the heuristics utilize, at least in part; at least one fringe 
to limit the search space. 

9. (Original) The system of claim 4, the query log comprising a histogram of queries asked 
over a time frame. 

10. (Original) The system of claim 9, the histogram of asked queries relates to a subset of the 
users; the subset comprising at least one user: 

1 1 . (Original) The system of claim 9 a the query log resides on a server computer. 

12. (Original) The system of claim 9, the query log resides on a client computer, 

13. (Cancelled). 

14. (Currently Amended) The system of claim l[[3]] 3 a substring comprising at least one 
selected from the group consisting of an entry in at least one trusted lexicon, an entry in a stop 
word list, and a sequence of characters without a pre-defined set of delimiter characters. 

15. (Currently Amended) The system of claim 1 [[3]], th e substring oo ooowrr e nc e statistics 
comprising substring bigrom oounto; a substring bigram comprising a pair of substrings in a text. 

16. (Original) The system of claim 15, the substring bigram comprising a pair of adjacent 
substrings in a text 
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17. (Cancelled). 

1 8. (Currently Amended) The system of claim 1[[3]], the substring occurrence and co- 
occurrence statistics from the query log are stored in a same searchable data structure. 

19. (Original) The system of claim 1 8, the data structure comprising a trie. 

20. (Original) The system of claim 18 handles concatenated and/or split substrings in a same 
manner as it handles individual substrings. 

21. (Original) The system of claim 20, the spell checking component generates a set of 
alternative spellings that are substrings in at least one selected from the group consisting of at 
least one query log and at least one lexicon. 

22. (Original) The system of claim 2 1 , the set of alternative spellings comprising a set of 
alternative spellings determined via an iterative correction process. 

23. (Original) The system of claim 22, the iterative correction process comprising a plurality 
of iterations that change at least one substring to another substring as an alternative spelling; the 
iterative correction process halts when all possible alternative spellings are less appropriate than 
a current set of alternative spellings. 

24. (Original) The system of claim 23, the alternative spellings and their appropriateness are 
computed based on a probabilistic string distance and a statistical context model. 

25. (Original) The system of claim 24, the probabilistic string distance comprising a modified 
context-dependent weighted Damerau-Levenshtein edit function that allows insertion, deletion, 
substitution, transposition, and long-distance movement of characters as point changes. 
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26. (Original) The system of claim 24, in each iteration, the set of alternative spellings for a 
substring is generated utilizing a searchable substring data structure extracted from at least one 
query log and at least one trusted lexicon. 

27. (Original) The system of claim 26, in each iteration, the set of alternative spellings for 
each substring is restricted to within a probabilistic distance 5 from an input substring; the 
restriction is imposed within each iteration without limiting the iterative correction process as a 
whole. 

28. (Original) The system of claim 27, in each iteration, the iterative correction process 
searches for an optimum set of alternative spellings via utilization of a statistical context model. 

29. (Original) The system of claim 28, the statistical context model comprising substring 
occurrence and co-occuirence statistics extracted from at least one query log. 

30. (Original) The system of claim 29, a Viterbi search is employed to facilitate in 
determining the optimum set of alternative spellings according to the context model in each 
iteration, 

3 1 . (Original) The system of claim 30, the Viterbi search can employ fringes to restrict a 
search for alternative spellings in an iteration such that for every parr of adjacent substrings, if 
any of the substrings is in at least one trusted lexicon, then only one of the substrings is allowed 
to change in that iteration. 

32. (Currently Amended) A method of facilitating spell checking, comprising: 
receiving input data containing text; 

identifying a set of potentially misspelled substrings in the text; ead 
generating a set of alternative spellings that are substrings in at least one selected from 
the group consisting of at least one query log comprising data utilized bv users to query a data 
collection over a time frame and at least one lexicon, the set of alternative spellings comprising a 
set of alternative spellings determined via an iterative correction process that includes searching 

• 
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for an optimum set of alternative spellings via utilization of a statistical context model: the 
statistical context model comprising substring occurrence and co-occurrence statistics extracted 
from the at least one query log; 

employing a Viterbi search to facilitate in d etermining the optimum set of alternative 
spellings according to the context mode l in each iteration: the Viterbi search can employ frin£es 
to restrict a search for alternative spellings in an iteration such t hat for every pair of adjacent 
substrings, if anv of the substrings is in at least one trusted lexicon, then o nly one of the 
substrings is allowed to change in that iteration: and 

proposing at least one alternative spelling for the substring set based on at least on e qu e ry 
log; tho query log comprising^ata utilized by users to qu e ry a data oollootion ovor a tim e fram e. 

33. (Cancelled). 

34. (Currently Amended) The method of claim 32 [[33]], further comprising: 
employing, at least m part, a list of stop words to facilitate in deternuning at least one 

alternative spellin g; th e list of stop words comprising at looat on e sel e cted from th e group 
consisting of a liot of atop words with cont e nt and a list of Gtop words without oontont ; 

utilizing substring occurrence and co-occurrence statistics from at least one query log; the 
query log comprising a histogram of queries asked over a time frame and the substring 
occurrence and co-occurrence statistics from the query log are stored in a same searchable data 
structure: and 

handling concatenated and/or split substrings in a same manner as handling individual 
substrings; aad 

generating a g e t of alt e rnativ e spollinga - tiiat are substrings in at least on e selected from 
fe o - group consisting of at l e ast on e qu e sy log and at least ono lexicon, the set of alternative 
spellings comprising a sot of alternative spellings determined via an it e rativ e corr e ction process . 

35. (Original) The method of claim 34, the iterative correction process comprising: 
changing at least one substring to another substring as an alternative spelling; and 
halting the iterative correction process when all possible alternative spellings are less 

appropriate than a current set of alternative spellings; the alternative spellings and their 
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appropriateness are computed based on a probabilistic string distance and a statistical context 
model. 

36. (Currently Amended) The method of claim 35, further comprising in each iteration of the 
iterative correction process: 

utilizing a searchable substring data structure extracted from at least one query log and at 
least one trusted lexicon to generate the set of alternative spellings for a substring; and 

restricting the set of alternative spellings for each substring to within a probabilistic 
distance 5 from an input substring; the restriction being imposed within each iteration without 
limiting the iterative correction process as a whole; aaad 

searching fe* on optimum s e t of alternative op e llings via utilization of a otatistical cont e xt 
model; th e statistical context mod e l comprising substring occurrence and co occurrenc e s tatist i c s 
extracted from at least ono query log . 

37. (Cancelled). 

38. (Currently Amended) A system that facilitates spell checking queries to a search engine, 
comprising: 

means for receiving input data containing text; and 

means for identifying a set of potentially misspelled substrings in the text and proposing 
at least one alternative spelling for the substring set based on at least one query log; the query log 
comprising data utilized by users to query a data collection over a time frame, the means for 
identifying a set of potentially misspelled substrings in the text utilizes substring occurrence and 
co-occurrence statistics from the at least one query log, the substring co-occurrence statistics 
comprising substring bigram counts with stop -word-sequence-skippin g counts: a substring 
bigram comprising a pair of substrings in a text . 

39. (Cancelled) 

40. (Cancelled) 



7 



PAGE 7/12 * RCVD AT 7(20/2006 5:54:16 PM [Eastern Daylight Time] ^ SVR:USPT0-EFXRF-3/1 1 * DNIS:2738300 * CSID:216 696 8731 * DURATION (mm-ss):0346 



'07/20/2006 16: 4S FAX 216 696 6731 
10/01,968 



AMIN, & TUROCY LLP, 



@008 



MS306752.01/MSFTP585US 



41 . (Currently Amended) A device employing the method of claim 32 comprising at least 
one oolootod from tho group conflicting of a computer, a server, and a handheld electronic device. 

i 

42. (Currently Amended) A device employing the system of claim 1 comprising at least one 
oolootod from th e group oonoiating of a computer, a server, and a handheld electronic device. 
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